HopsFS: Scaling Hierarchical File System Metadata Using NewSQL Databases

نویسندگان

  • Salman Niazi
  • Mahmoud Ismail
  • Seif Haridi
  • Jim Dowling
  • Steffen Grohsschmiedt
  • Mikael Ronström
چکیده

Recent improvements in both the performance and scalability of shared-nothing, transactional, in-memory NewSQL databases have reopened the research question of whether distributed metadata for hierarchical file systems can be managed using commodity databases. In this paper, we introduce HopsFS, a next generation distribution of the Hadoop Distributed File System (HDFS) that replaces HDFS’ single node in-memory metadata service, with a distributed metadata service built on a NewSQL database. By removing the metadata bottleneck, HopsFS enables an order of magnitude larger and higher throughput clusters compared to HDFS. Metadata capacity has been increased to at least 37 times HDFS’ capacity, and in experiments based on a workload trace from Spotify, we show that HopsFS supports 16 to 37 times the throughput of Apache HDFS. HopsFS also has lower latency for many concurrent clients, and no downtime during failover. Finally, as metadata is now stored in a commodity database, it can be safely extended and easily exported to external systems for online analysis and free-text search.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Metadata-Rich File System

Despite continual improvements in the performance and reliability of large scale file systems, the management of file system metadata has changed little in the past decade. The mismatch between the size and complexity of large scale data stores and their ability to organize and query their metadata has led to a de facto standard in which raw data is stored in traditional file systems, while rel...

متن کامل

Design and Implementation of a Metadata-Rich File System

Despite continual improvements in the performance and reliability of large scale file systems, the management of user-defined file system metadata has changed little in the past decade. The mismatch between the size and complexity of large scale data stores and their ability to organize and query their metadata has led to a de facto standard in which raw data is stored in traditional file syste...

متن کامل

NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management

One of the key advances in resolving the “big-data” problem has been the emergence of an alternative database technology. Today, classic RDBMS are complemented by a rich set of alternative Data Management Systems (DMS) specially designed to handle the volume, variety, velocity and variability of Big Data collections; these DMS include NoSQL, NewSQL and Search-based systems. NewSQL is a class of...

متن کامل

Extended File System Metadata Management with Relational Databases

Modern file systems need to handle extended metadata. Existing file systems are not equipped to handle managing metadata in the amount of files and the diversity of files that these file systems are now supporting. Users need better searching and querying capabilities. Metadata within files still remain applicationand file format-specific and is often proprietary, which makes searching difficul...

متن کامل

NASA Earth Observing System Data and Information System Implementation of Extra- and Inter- File Metadata: Application to Data Services

NASA's Earth Observing System (EOS) Data and Information System Core System (ECS) will become a very large repository for earth sciences data, which will include instrument measurements, derived model products, ancillary data from sources other than EOS, and so on. In addition to data archival functions, the system will also produce and distribute data. The needs of a heterogeneous community fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017